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Abstract 

Although attribute grammars are commonly used for compiler con- 
struction, little investigation has been conducted on debugging at- 
tribute grammars. The paper proposes two types of systematic de- 
bugging methods, an algorithmic debugging and slice-based debug- 
ging, both tailored for attribute grammars. By means of query-based 
interaction with the developer, our debugging methods effectively nar- 
row the potential bug space in the attribute grammar description and 
eventually identify the incorrect attribution rule. We have incorpo- 
rated this technology in our visual debugging tool called Aki. 

1 Introduction 

The attribute grammar (AG) is a formal framework to express both syntax 
and semantics of programming languages [|J . An attribute grammar descrip- 
tion comprises a set of productions (BNF rules) and a set of attribution rules 
denned over the attributes associated with the grammar. 

Attribute grammars are easy to describe and understand because they de- 
scribe "what the programming language semantics is like" but not "how their 
attribution rules are actually implemented." An AG-based compiler- compiler 
takes an attribute grammar description of a programming language and gen- 
erates an efficient compiler for it. Attribute grammar has been successfully 
used to describe various programming languages and their processors. 
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On the other hand, debugging an attribute grammar is not simple. De- 
bugging an attribute grammar description using a standard debugger exposes 
the attribute grammar implementation such as the attribute evaluator, which 
is usually a program generated from the attribute grammar description, and 
the runtime representation of attributes and parse trees. 

The paper proposes AG-aware debugging techniques for attribute gram- 
mars. By "AG-aware" we mean that the debugger is aware of attribute gram- 
mars and thus debugging does not necessarily require knowledge about strat- 
egy and implementation of attribute evaluation. We have formerly applied 
an algorithmic debugging |l(J technique to attribute grammar ||. This paper 
is on the same line but further incorporates slice-based debugging technique 
as well. By means of query-based interaction with the compiler developer, 
both techniques effectively narrow the potential bug space in the attribute 
grammar description and eventually identify the incorrect attribution rule. 

The benefit of our approach, independence from the implementation of 
particular AG-based compiler-compiler, is twofold: (1) the compiler devel- 
oper is freed from understanding implementation of the compiler-compiler 
and (2) the proposed technique can be applied to other AG-based system. 

Our hybrid debugging technique has been implemented as a visual de- 
bugging environment called Aki. It is written in Squeak Smalltalk ||. The 
resulting system has been used in our project that incorporates AG technolo- 
gies in most phases of compiler construction — including transformation of 
intermediate code, optimization, and code generation ||. 

The rest of this paper is as follows. Section |2] briefly introduces the 
attribute grammar and attribute evaluation, sections || and £| explain the 
two debugging techniques, section |5] describes the visual debugger, section || 
discusses the debugger, and section |7| concludes this article. 



2 Attribute grammars 

The attribute grammar [|J is a description that provides both syntax and 
semantics of an input string. It is one of fundamental system used for for- 
malization and construction of compilers. 

Fig. |I] is an example of an AG description that evaluates binary represen- 
tation of numbers. An attribute evaluator generated from the AG description 
takes an input string (e.g., ".Oil") and parses it in accordance with the syntax 
definition of the grammar. Then the attribute evaluator computes attribute 
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F ::= . L 
{ L.pos = 1; 
F.val = L.val } 
L ::= B Li 

{ Li.pos = L Q .pos + 1; 
B.pos = L .pos + 1; 
L .val = B.val + L\.val } 

I B 
{ B.pos = L .pos; 

L .val = B.val } 

B ::= 1 

{ B.val = 2~ B - pos } 

{ B.val = } 



(bug) 



Figure 1: An example of attribute grammar 




val=0.125 



Figure 2: Attributed parse tree 
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values for each node in the parse tree according to the dependency among 
attributes and associates them to the respective node. In this manner, an 
attributed parse tree is created from the input string (see Fig. |2|). The shaded 
arrows represent dependency between attributes. 

There are two kinds of attributes. One is called inherited attribute, whose 
value is computed from the values of the attributes associated with the ances- 
tor and sibling nodes. The other is called synthesized attribute, whose value 
is computed from the values of the attributes associated with the children 
nodes. 

We have intentionally introduced a bug in the AG description for the sake 
of discussion in the following sections. Because of this, some of the attribute 
values in Fig. |2| are wrong. 



3 Application of algorithmic debugging to at- 
tribute grammar 



Algorithmic debugging [[T(| is a systematic bug locating technique. With 
programmer's guidance, an algorithmic debugger locates a bug in program 
execution. Algorithmic debugging formulates execution in terms of computa- 
tion tree which is defined as recursive logical deduction of logic programming, 
or recursive /^-reduction for functional programming. The debugging method 
has been applied to functional |7| or procedural language |lj . 

To apply algorithmic debugging to attribute grammar paradigm, we need 
to formulate attribute evaluation as some form of recursive application. In 
H we have shown that the notion of Synth function || suits for this purpose. 

It is known that for any synthesized attribute s of a node N in the parse 
tree, its value N.s is uniquely defined using the Synth function F N . S : 



N.s = F N , s (N.I N , s ,tree 



where tree^ stands for the subtree of the parse tree rooted at node N and 
N.In.s f° r a set °f inherited attributes of N on which N.s directly or indirectly 
depends in treejy @. 

Because the entire attribution for the parse tree can be represented as 
recursive application of Synth functions, we can use Synth functions as basis 
for formulating computation tree for attribute grammars. 

We will explain how the bug in Fig. [I] can be detected by the algorithmic 
debugging. When a string ".Oil" is given as an input to the description, 
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("<F F val , { tree= . } ,F. val=0.25> ) 
kFn vai , { L 1 .pos= 1 , tree=. . . } ,L 1 . val=0.25>) 

y — y 

fohi . va i , { B 1 .pos=2, tree=. . . } ,B 1 . val=0^ \ 

(a) (<F L2val ,{L2.pos=2, tree=... },L2.val=0.25>) 
(c) ^F B2 . va i,{B2.pos=3, tree- ..},B2.val=2 3 

( b) ^F L3 .vai,{L3.pos=3, tree- ..},L3.val=0.125j 
^F B 3.vai,{B3.pos=3, tree=...},B3.val=2 3 ^ 

Figure 3: Computation tree 



the computation tree in Fig. |3] is created. First the debugger chooses node 
(a) in this figure, and queries the user whether L 2 .val = 0.25 is correct 
with respect to the argument L 2 .pos and the subtree rooted at L 2 . This 
query means whether the binary number "11" from the second decimal place 
represents 0.25, and the user can respond that this is incorrect because the 
correct value is 0.375. Given this, the debugger prunes the computation tree 
above (a) from the search space. Then the debugger queries similar question 
for node (b). This time the value is correct and the subtree rooted at (b) is 
pruned. Next the debugger queries similar question for node (c). Finally the 
debugger locates the attribution rule for node (a) as containing a bug. 

As for the applicability of the method to classes of AG, || gives how to 
apply algorithmic debugging to noncircular AG, absolutely noncircular AG, 
simple multi-visit AG and to its subclasses. 



4 Slice-based debugging 

Techniques that utilize the notion of program slices have been applied to 
program verification, program testing, version management, and systematic 
program debugging. Shimomura proposed a systematic debugging method 



of procedural programming languages using slices [[□]]. We applied this idea 
to the attribute grammar framework. 
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Li.pos = 1 

L 2 .pos = Li.pos + ]g= 2 
B 2 .pos = L 2 .pos + 1 = 3 
B 2 .val = 2~ B2 -p° s = 2~ 3 
Li.pos = L 2 .pos + 1=3 
B 3 .pos = Li.pos = 3 
B 3 .val = 2~ B3 - pos =^2r 3 
L 3 .val = B 3 .val 
L 2 .val = L 3 .val + B 3 .val 



Figure 4: Slice partitioning 



The dynamic program slice of an execution of a statement s in a program 
is a set of all the statements upon which s depends, directly or indirectly 
[|5|. In attribute grammars, attribute evaluation of a given parse tree can 
be considered to be a sequence of evaluation of node attributes, and we can 
define dynamic program slices for a given attribute grammar description and 
its input program. For example, the sequence on the right side in Fig. |] is 
a possible attribute evaluation sequence of L 2 .val for AG in Fig. p] and the 
input ".Oil." The directed graph on the left side illustrates direct dependency 
among the attributes. The slice for a given attribute (instance) is a set of 
attributes upon which the attribute depends.^] For example, the slice for 
L 3 .pos is a set of attributes upon which L 3 .pos directly or indirectly depends, 
namely, L 2 .pos and Li.pos. 

Our debugging strategy works as follows. Suppose that we already know 
that a slice contains an incorrect attribution that is triggered by an unknown 
bug. The debugger partitions the attribute evaluation sequence into arbitrary 
two subsequences (e.g., s\ and s 2 in Fig. f|). Then the debugger queries the 
user about correctness of evaluation. This is accomplished by asking the 
correctness of the attribute values crossing the boundary between the two 



AG, the slice can be given for an attribute instance rather than for evaluation of 
each attribute, because each attribute is evaluated only once. 



Systematic Debugging of Attribute Grammars 



X B 




-- all ~ 

as yet unclassified 



instance 



class 



ruleEorAdd:Ej;P:EXP: 
ruleEor Assign : EXP : EXP : 
ruleForCmm : CFUNC : 
ruleForComp : LEXICAL : CSTM : 
ruleForEunc : LEXICAL : LEXICAL 
ruleEorlf : CMPOP : CSTM : CSTM : 
ruleEorNull: 

menaSaaaiB: 



ruleForlf : if CHPOP: cmp CSTM: stml CSTM: stm2 

<stml.table> *■ <if .table>. 
<stm2.table> <if .table>. 

<if.gen> *- DTest3 union: <stml.gen:> and: <stm2.gen>. 
<if.lcill> <- DTest3 intersection: <stml.kill> and: 
<stm2.kill>. 

<stml.in> *■ <if.in>. 
<stm2.in> *r <if .in>. 

<if.out> *■ DTest3 union: <stml.out> and: <stm2.out>. 




if 

nodeld -> 23 

table->Dictionary (vl->Set (43 
32 9 ) v2->Set (17 ) ) 
in->Set (9 17 ) 
out->Set (43 32 17 ) 



Figure 5: Example of execution of Aki 



subsequences (e.g., correctness of B 2 .val and L 2 .pos). If one of the crossing 
attribute values turns out to be incorrect, then the debugger identifies that 
the error was triggered in the subslice of that attribute. Otherwise, when 
all the crossing attribute values are correct, then the debugger excludes the 
subsequence s± from its bug-locating search space. 

For example in Fig. [|, value assignment to B2.VC1I, 2 -3 is different from its 
expected value, 2~ 2 (B 2 .val stands for the value of binary number (0.01)2). 
Given this information from the user, the debugger understands that the bug 
inhabits somewhere in the subslice of B 2 .val. Next, the debugger divides the 
subslice of B 2 .val and queries whether the value of L 2 .pos = 2 is correct. The 
user answers "yes". Then the debugger narrows the search space to B 2 .pos 
and B 2 .val, and queries whether the value of B 2 .pos = 3 is correct. Since the 
user says it is incorrect, the debugger can locate the bug to a single attribute 
B 2 .pos, and it identifies that the attribution rule u B 2 .pos = L 2 .pos + 1" 
contains a bug. 
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:1a; 



inherited attribute is 
pos = 2 
synthesized attribute is 
val = (1/4) 
correct? 

yes 
no 

do not know 



o2 



n ii 



bl 



Figure 6: Example of query of algorithmic debugging 




Figure 7: The screen shot when the bug is located 



5 Aki: the debugger 

The Aki visual debugger is a part of our AG-based compiler construction 
system that comprises a compiler frontend generator called Rie and backend 
generator called Jun||. Aki is used to locate bugs in the compiler backend 
description. The two systematic debugging techniques explained in earlier 
sections are incorporated in Aki. Because Jun accepts fairly large class of 
attribute grammars, this approach can be applied to other AG evaluators 

i- 

When the compiler developer finds that the compiler generates an incor- 
rect code sequence for some source program, then he/she can supply both 
the compiler backend description and the source program to Aki for debug- 
ging. Fig. |5| shows a screen shot of a debug session using Aki. The panes 
presents attribution rules, input source program supplied to the debugged 
compiler, values of attributes, and the parse tree of the source program in 
several forms. These panes work cooperatively: user's interaction with one 
pane is reflected to others. 

The user can choose systematic debugging strategies with buttons: A- 
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lass 



pos = 2 
correct? 
yes 



-debug step 



b2 



N 11 



bl 



Figure 8: Example of query of slice-based debugging 



debug button for algorithmic debugging and S-debug button for slice-based 
debugging. 

The algorithmic debugger chooses an arbitrary node from the computa- 
tion tree and ask the user about correctness of the respective computation. 
In Fig. ||, the debugger is asking about correctness of the computed attribute 
value ( u val = 1/4"). Aki helps the user answer this question by highlighting 
the respective subtree of the parse tree in dark-gray and showing inherited 
attribute values given as inputs to this computation ( ll pos = 2"). The user's 
answer to this question is used by the debugger to narrow the search space 
for erroneous code in the program. This narrowing is repeatedly applied until 
the debugger eventually locates the erroneous attribution rule (in Fig. ^, Aki 
successfully located a bug in ruleForl2 rule.). 

The user starts slice-based debugger pointing it an incorrect attribute 
value in the attributed parse tree. The slice-based debugger systematically 
locates the source of the problem in the attribution rules. The debugger 
computes the slice for the incorrect attribute and partition the slice into 
two subslices. Then the debugger queries if incorrect attribute values are 
passed from one subslice to another. This is accomplished by asking separate 
questions for these attribute values, respectively. In Fig. Aki is asking 
about correctness of the value of one of those attributes. Given answers to 
these questions, the debugger narrows the search space for erroneous code 
into one of the subslices and eventually locate the incorrect attribution rule. 

An advantage of slice-based debugging it can debug a program that failed 
to complete in the middle of its execution. On the other hand, it is impossible 
to create a computation tree for the entire computation of such program and 
hence difficult to apply algorithmic debugging. 
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algorithmic 
debugging 


slice-based 
debugging 


without 
Aki 


number of 
queries 


9 


13 


NA 


time to 
locate bugs 
(min.) 


3 


3 


15 



Figure 9: Result of a user test 

6 Experiments and discussion 

6.1 Evaluation of Aki 

Aki has been used to debug descriptions of several modules in our com- 
piler construction project for the C language; examples of modules are SSA- 
conversion and liveness analysis. Users report it is easier to find bugs by 
using Aki than by using conventional approach. 

We have done user test to evaluate usefulness of Aki. Three experienced 
compiler programmers are chosen from our team as subjects. They are shown 
a source program of some compiler module. We have included a typical 
mistake in the source program. Then the programmer is asked to locate using 
algorithmic debugging feature of Aki, using slice-based debugging feature of 
Aki, or without support by Aki. The 123 lines long test program contains 
18 attribution rules. Given a sample C program that triggers the compiler's 
bug, the parser generates a parse tree of 77 nodes and 171 attribute instances. 

Fig. H shows how the choice of debugging method affects the time to locate 
bugs. The numbers imply effectiveness of the systematic debugging approach 
for AG-based compiler construction, at least for smaller sized modules. 

6.2 Discussion 

A known problem in algorithmic and automatic debugging is that the user 
has to understand the question and answer it correctly (|l[ [fij 0). 

For example in the algorithmic debugging of AGs, the user may have 
difficulties in checking whether the behavior of a Synth function or the values 
of attributes are correct. This is due to the fact that the user has to look at 
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the subtree in the parameter of the Synth function, and furthermore attribute 
values may be a set and may have many elements in compilers. 

These problems have been partly solved from the algorithmic point of 
view by extending the debugging algorithm in several situations [8]. For 
example, the extended algorithm can deal with the case when the user find 
the value of the inherited attribute — given as a premise of a query by the 
system — is itself wrong, or when the user cannot reply to a query with 
confidence. Some part of the extended algorithm is implemented in Aki. 

On the other hand, from the implementation point of view, we made 
several efforts in making Aki so that the user can easily grasp the attribute 
values and the subtree in a query. For example, Aki can show the source 
code corresponding to any subtree. When a new attribute value is computed 
by combining several attribute values with some operation, Aki highlights 
the part of the original attribute values within the new attribute value by a 
separate color, making the difference of both attributes clear. 

However, we think further improvement in the algorithm and the imple- 
mentation is necessary. 

7 Conclusion 

This paper has presented two systematic debugging techniques for attribute 
grammars, one is based on the algorithmic debugging and the other is based 
on the program slicing. These techniques have been incorporated in our 
high-level visual debugging tool called Aki. 

There remain some issues that require further investigation. Currently the 
two systematic debugging features are provided as separate technologies. We 
plan to integrate the two techniques and search for more effective debugging 
methodology. 
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